CLAIMS 



What is claimed is: 



X An apparatus for filtering documents as received, the apparatus 
comprising: 

zNtocument parser, said document parser accepting documents as input and 
providing inWted lists of terms contained in the documents as output; 

a profileWrser, the profile parser accepting user queries as input as 
providing a query n^t representing the query as output; and 

a comparator ftoit compares the inverted list for an incoming document 
against the query net representing the user query and providing as output an 
indication whether the incoming document matches the user query. 

2. The apparatus of claim 1 wterein said profile parser provides as output a 
term dictionary containing terms fronMhe user query, and wherein the document 
parser uses said term dictionary to eliminate terms from said inverted list. 

3. The apparatus of claim 1 wherein said profile parser accepts a plurality of 
user queries and stores a corresponding plurality of query nets in a memory 
element. 

4. The apparatus of claim 3 wherein said comparator compares an inverted 
list associated with an incoming document against each of said plurality of stored 
query nets. 

5. The apparatus of claim 1 wherein said comparator provides an indication 
that the incoming document matches the user query if said inverted lists contain a 
minimum number of terms contained in the query net. 
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6. The apparatus of claim 1 wherein said profile parser, said document parser, 
and said comparator reside on separate machines, said separate machines 
interconnected by a network. 

A method for filtering incoming documents, the method comprising the 
stehs of: 

. (a) receiving an incoming document and parsing it to produce an 
inverted l^st of terms contained in incoming document; 

(b) \ using the produced inverted list to retrieve query nets representing 
user queries; 

(c) di^arding retrieved query nets matching less than a predetermined 
number of terms; 

(d) scoring ^naming profile and discarding profiles having a score less 
than a predetermined thresMd. 



1 8. The metHfcd of claim; 7 further comprising the step of receiving user queries 
and parsing the user qb^ReSro produce query nets representing the query. 



1 J>? The method of claim 7 further comprising that step of storing all query nets 

2 associated with a user as a clipset. 

1 10. The method of claim 7 further comprising the step of storing an inverted 

2 list associated with an incoming document. 



1 yri" . The method of claim y further comprising the step of storing corpus 

2 statistics in the clipset. 

12. An artic^of manufacture having computer-readable program means for 
2 filtering incoming dobuments, the article comprising: 
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3 (a\ computer-readable program means for receiving an incoming 

4 document afod parsing it to produce an inverted list of terms contained in incoming 

5 document; 

6 (b) cbmputer-readable program means for using the produced inverted 

7 list to retrieve queV nets representing user queries; 

8 (c) comp\ter-readable program means for discarding retrieved query 

9 nets matching less than\ predetermined number of terms; 

10 (d) computer-readable program means for scoring remaining profile 

1 1 and discarding profiles havinW a score less than a predetermined threshold. 
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13. The artidte of manufacture of claim 12 further comprising computer- 
readable program\eans fo: ^receiving user queries and parsing the user queries to 
produce query nets re^resei Lting the query. 

\v \ if 

yt. The article of manufacture claim VI further comprising computer-readable 
means of storing all query nets associated with a user as a clipset. 
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The article of manufacture of claim Vt further comprising 
computer-readable means of storing an inverted list associated with an incoming 
document. ^ 



The article of manufacture of claimyg further comprising computer-readable 
means of storing corpus statistics in the clipset. 





